Deep Reactive Policies for Planning in Stochastic Nonlinear Domains

نویسندگان

چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning Reactive Policies for Probabilistic Planning Domains

We present a planning system for selecting policies in probabilistic planning domains. Our system is based on a variant of approximate policy iteration that combines inductive machine learning and simulation to perform policy improvement. Given a planning domain, the system iteratively improves the best policy found so far until no more improvement is observed or a time limit is exceeded. Thoug...

متن کامل

Decomposition Techniques for Planning in Stochastic Domains Decomposition Techniques for Planning in Stochastic Domains

This paper is concerned with modeling planning problems involving uncertainty as discrete-time, nite-state stochastic automata. Solving planning problems is reduced to computing policies for Markov decision processes. Classical methods for solving Markov decision processes cannot cope with the size of the state spaces for typical problems encountered in practice. As an alternative, we investiga...

متن کامل

Decomposition Techniques for Planning in Stochastic Domains

This paper is concerned with modeling p lann ing problems invo lv ing uncerta inty as d iscre tet ime, f in i te -s ta le stochastic au toma ta So lv ing p l ann ing problems is reduced to comp u t i n g policies for Markov decision processes Classical methods for solv ing Markov decision processes cannot cope w i t h the size of the state spaces for typ ica l problems encountered in pract ice ...

متن کامل

Discrepancy Search with Reactive Policies for Planning

We consider a novel use of mostly-correct reactive policies. In classical planning, reactive policy learning approaches could find good policies from solved trajectories of small problems and such policies have been successfully applied to larger problems of the target domains. Often, due to the inductive nature, the learned reactive policies are mostly correct but commit errors on some portion...

متن کامل

Reactive Policies with Planning for Action Languages

We describe a representation in a high-level transition system for policies that express a reactive behavior for the agent. We consider a target decision component that figures out what to do next and an (online) planning capability to compute the plans needed to reach these targets. Our representation allows one to analyze the flow of executing the given reactive policy, and to determine wheth...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the AAAI Conference on Artificial Intelligence

سال: 2019

ISSN: 2374-3468,2159-5399

DOI: 10.1609/aaai.v33i01.33017530